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Abstract 

We present a novel synthesis of Fisher information and asset pric- 
ing theory that yields a practical method for reconstructing the prob- 
ability density implicit in security prices. The Fisher information ap- 
proach to these inverse problems transforms the search for a proba- 
bility density into the solution of a differential equation for which a 
substantial collection of numerical methods exist. We illustrate the 
potential of this approach by calculating the probability density im- 
plicit in both bond and option prices. Comparing the results of this 
approach with those obtained using maximum entropy we find that 
Fisher information usually results in probability densities that are 
smoother than those obtained using maximum entropy. 

1 Introduction 

Probability laws that derive from a variational principle provide an opera- 
tional calculus for the incorporation of new knowledge. This feature, long 
exploited in the physical sciences, is becoming increasingly popular in fi- 
nance and economics where, for example, maximum entropy has found ap- 
plication as a useful and practical computational approach to financial eco- 
nomics (Maasoumi 1993, Sengupta 1993, Golan, Miller & Judge 1996, Fomby 
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& Hill 1997). It is generally felt that a candidate probability law p(x) should 
be as non-informative and smooth (in some sense) as possible while maintain- 
ing consistency with the known information about x. While this criterion has 
often been used to motivate the use of maximum entropy (Buck & Macaulay 
1990), other variational approaches provide similar - and potentially superior 

- degrees of smoothness to probability laws (Frieden 1988, Edelman 2001) and 
it is the purpose of this paper to show that Fisher information (Frieden 1998) 

- which provides just such a variational approach - can be used to reconstruct 
probability densities of interest in financial economics. In particular, Fisher 
information provides a variational calculus, a well developed computational 
approach to the estimation of probability laws and yields a probability law 
where smoothness is ensured across the range of support; in contrast to max- 
imum entropy where smoothness tends to be concentrated in regions where 
the probability density is very small. 

Since maximum entropy is comparatively well known in financial eco- 
nomics and as it shares with Fisher information a common variational struc- 
ture, we shall use it as a point of comparison when we review Fisher informa- 
tion in Sec. 121 In that section we shall see that Fisher information yields not 
an analytic expression for the probability law, but rather, a differential equa- 
tion for the probability law that is well known in the physical sciences: the 
Schroedinger equation. Using this fortunate correspondence we can exploit 
decades of development of computational approaches to the Schroedinger 
equation in the construction of probability laws in finance and economics. 

Like other variational approaches the Fisher information approach de- 
pends upon prior information in the form of equality constraints, and we 
shall explore this in Sec. HU where examples of increasingly complex structure 
will be given. In particular, exploiting the correspondence between (i) the 
resulting differential equations for probability laws in these examples from 
financial economics and (ii) the Schroedinger equation, will permit the gen- 
eration both of yield curves from observed bond prices and of probability 
densities from observed option prices in new and efficient ways. In addition, 
we shall see that the probability densities generated using Fisher informa- 
tion are, in general, smoother than those obtained from maximum entropy. 
A brief summary of our results is given in Sec. 0] which is followed by two 
appendices where details of the calculations are presented in somewhat ex- 
panded form. 
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2 Theory 



When observed data are expectation values, their relationship to the proba- 
bility density is that of a linear integral, and the calculation of the probability 
density from these observations is a linear inverse problem. 

In the most general case we have M observed data values d\, . . . , du = 
{dm} that are known to be averages of known functions {f m ( x )} and are 
related to a probability density function p(x) by 

•b 

f m (x)p(x)dx = d m , m = 1, . . . , M . (1) 

As there are an infinite number of probability densities that satisfy Eq. ^ 
a regularizer is commonly introduced to impose a structural constraint on 
the characteristics of p(x) and to choose a particular density This can be 
accomplished by constructing a Lagrangian and employing the variational 
calculus. In the case of maximum entropy, the regularizer is the Shannon 
entropy (Shannon 1948) 

-b 



H 



p(x) In [p{x)} dx 



with which one can form the Lagrangian 



M 



p(x) In [p{x)\ dx + A„ 



m=l 



f m (x)p(x)dx - d ri 



(2) 



(3) 



where the {A m } are the Lagrange undetermined multipliers. For this La- 
grangian the extremum solution with zero first variation is known to be ( Jaynes 
1968) 
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where 
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dx 



(5) 



By comparison, in most uses of the Fisher information approach the Shannon 
entropy is simply replaced with Fisher information (Fisher 1925) in its shift- 
invariant form 

" b p'(x) 2 



-dx 



(6) 
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where the prime denotes differentiation with respect to x. Consequently the 
Lagrangian becomes 



f b p'(x) 2 M r f b 

/ 7 \ dx + 52 Xm / /' 



(x)p(x)dx — d r , 



(7) 



As before we seek an extremum solution with zero first variation. This time, 
however, the Euler-Lagrange equations result in a differential equation for 
p(x): 



dx 



p'(x) 
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(8) 



m=l 



This equation can be simplified through the use of a probability amplitude 
function q(x) where q 2 (x) = p(x). Substituting the probability amplitude 
into Eq. |H] yields the expression 



d 2 q(x) 
dx 2 



4 



M 



5 j ^rnfv 



X 



(9) 



m=l 



known in the physical sciences as the Schroedinger equation. 

Thus we see that, although the use of Fisher information results in a 
differential equation for p(x) instead of an algebraic function, the resulting 
differential equation for q(x) is one for which a number of analytic solutions 
are known and for which a substantial collection of numerical solutions exist. 
Furthermore, as we shall see in the next section, the Fisher information solu- 
tions are, in general, smoother than those obtained using maximum entropy. 



3 Examples 

In this section we use the Fisher information approach to motivate practi- 
cal computational approaches to the extraction of probability densities and 
related quantities from financial observables. In each case we contrast the 
results obtained by this method with those of maximum entropy. We begin 
in Sees. 13.11 and 13.21 with examples from fixed income where observables are 
expressible in terms of partial integrals of a probability density and of first 
moments of the density. In Sees. 13 .31 and 13 .41 we finish the examples by exam- 
ining the canonical problem of extracting risk-neutral densities from observed 
option prices and extend this analysis by showing how Fisher information can 
be used to calculate a generalized implied volatility. 
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3.1 The Probability Density in the Term Structure of 
Interest Rates: Yield Curve Construction 



A basis for the use of information theory in fixed-income analysis comes from 
the perspective developed by Brody & Hughston (2001) who observed that 
the price of a zero-coupon bond - also known as the discount factor D(t) - can 
be viewed as a complementary probability distribution where the maturity 
date is taken to be an abstract random variable. The associated probability 
density p(t) satisfies p(t) > for all t, and 



D(t) = J p(r)dT, (10) 

POO 

= / Q{T-t)p{r)dr , (11) 

JO 

where Q(x) is the Heaviside step function 1 . 

To illustrate the differences between the Fisher information and maximum 
entropy approaches to term structure estimation we consider the case of a 
single zero coupon bond of tenor T = 10 years and a price of 28 cents on the 
dollar or, equivalently, a discount factor of 0.28 2 . The maximum entropy 
solution, Eq. HI is 

e -Ae(t-r) 

P»W = TTT/I , (12) 

and the corresponding discount factor is D(T) = I /(XT + 1) from which the 
Lagrange multiplier is found to be 0.2571. 

The Fisher information solution is obtained by solving Eq. |U] with the 
constraints of normalization and one discount factor: 

m = f [Xo + XM t- m , (13) 

This problem is known 3 to have the solution 

q W-\Be-* ift>T, (i4j 

where the coefficients A and B are determined by requiring that q(t) and 
q'(t) (or the logarithmic derivative q'(t)/q(t)) match at t — T. Matching the 
logarithmic derivative yields 

tan(aT) = (3/ a . (15) 
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Choosing aT = 7r/4 and matching amplitudes we find that B = A exp (vr/4) / \^2, 



8a cos (at) f t ^ T 

/'//<') = < ,Jtt^-m " I (16) 

7T+4 ' 

and D(T) = 2/(vr + 4) = 0.28. 

The maximum entropy and Fisher information probability densities (Eqs. 
^]andEH r espectively) and the corresponding term structure of interest rates 
are illustrated in Fig. ^ In the uppermost panel we see both probability den- 
sities as a function of tenor with the Fisher information result denoted in all 
panels by the solid line. As expected, the maximum entropy result is uni- 
form until the first (and in this case only) observation is encountered; beyond 
which a decaying exponential is observed. The Fisher information result is 
smoother reflecting the need to match both the amplitude and derivative at 
the data point. 

Looking at the probability density is difficult to make any real aesthetic 
choice between the two results. For this we turn to the two important derived 
quantities: the spot rate r(t) and the forward rate fit) that are related to 
the discount factor by 

D(t) = e~ r(t) \ (17) 
= e/o'/W*. (is) 

The spot rates are shown in the middle panel of Fig. ^ Both methods yield 
a smooth result with the Fisher information solution showing less structure 
than the maximum entropy solution. A greater difference between the two 
methods is seen in the lowermost panel of Fig. ^ where the forward rate 
is shown. It is the structure of this function that is often looked to when 
assessing the relative merits of a particular representation of the discount 
factor. The forward rate reflects the structure of the probability density as 
expected from the relationship 

f(t)=p(t)/D(t), (19) 

with the maximum entropy result showing more structure than the Fisher 
information result due to the continuity at the level of the first derivative 
imposed on p(t) by the Fisher information approach. 
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Figure 1: The probability density and derived interest rates for a single ten 
year discount factor of 0.28 
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It is a comparatively straightforward matter to extend this approach to 
the construction of a term structure that is consistent with any number of ar- 
bitrarily spaced zero coupon bonds. A particularly convenient computational 
approach based on transfer matrices is presented in Appendix A. 

While previous work on inferring the term structure of interest rates from 
observed bond prices has usually focused on the somewhat ad hoc applica- 
tion of splines to the spot or forward rates (McCullough 1975, Vasicek & 
Fong 1982, Shea 1985, Fisher, Nychken & Zervos 1995), the work of Frish- 
ling & Yamamura (1996) that minimized df(t)/dt is similar in spirit to the 
Fisher information approach. Their paper dealt with the often unacceptable 
results that a straightforward application of splines to this problem of in- 
ference can produce. In some sense, Fisher information can be seen as an 
information-theoretic approach to imposing the structure sought by Frishling 
and Yamamura on the term structure of interest rates. 4 

3.2 The Probability Density in a Perpetual Annuity 

Material differences between the probability densities generated by Fisher 
information and maximum entropy are also seen when the observed data are 
moments of the density - a common situation in financial applications. The 
value of a perpetual annuity £, for example, is given by (Brody & Hughston 
2001, Brody & Hughston 2002) 



This first-moment constraint provides an interesting point of comparison for 
the maximum entropy approach employed by Brody and Houghston and our 
Fisher information approach. The maximum entropy solution is known to 
be 



while the Fisher information solution is known (Frieden 1988) to be 



where Ai(x) is Airy's function and the constants q are determined uniquely 
by normalization and by the constraint of Eq. 1201 We show these two prob- 
ability densities as a function of tenor for a perpetual annuity with a price 




(20) 




(21) 



p F i(t) = ciAi 2 (c 2 t) , 



(22) 
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Figure 2: The probability density functions associated with a perpetual an- 
nuity with a price of 1.0. 
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of $1.00 in Fig. |21 The two solutions are qualitatively similar in appearance; 
both monotonically decrease with tenor and are quite smooth. However, 
since the Fisher information solution starts out at zero tenor with a lower 
value than the maximum entropy solution and then crosses the maximum 
entropy solution so as to fall off more slowly than maximum entropy solution 
in the mid-range region, the Fisher information solution seems to be even 
smoother. In the context of Fisher information, a measure of smoothness is 
the size of the Cramer-Rao bound (CRB) 1/1 (Frieden 1988) that, for the 
Fisher information solution is found to be 1.308 by integrating Eq. |BJ The 
CRB for the maximum entropy solution is 1.000. Hence, by the criterion of 
maximum Cramer- Rao bound, the Fisher information solution is significantly 
smoother. We also compared the relative smoothness of these solutions using 
Shannon's entropy, H, (Eq. |2J as a measure. The maximum entropy solu- 
tion is found to have an H of 1.0, while the Fisher information solution has 
an H of 0.993. Hence the maximum entropy solution does indeed have a 
larger Shannon entropy than does the Fisher information solution, but it is 
certainly not much larger. Since the two solutions differ much more in their 
CRB values than in their Shannon H values, it appears that the CRB is a 
more sensitive measure of smoothness, and hence biasedness, of a probability 
law. 

An interesting result emerges from an examination of the smoothness as 
a function of range. Since over the range < T < 5 shown in Fig. |21 it ap- 
pears that the Fisher information ought to be measurably smoother than the 
maximum entropy solution by any criterion, we also computed the Shannon 
H values over this interval alone. The results are H = 0.959 for maximum 
entropy and H = 0.978 for Fisher information: the Fisher information solu- 
tion has the larger entropy. This shows that the Fisher information solution 
is smoother than the maximum entropy solution, even by the maximum en- 
tropy criterion, over this finite interval. It follows that the maximum entropy 
solution has the larger entropy overall only due to its action in the long tail 
region T > 5. Note, however, that in this region the maximum entropy 
solution would have negligibly small values for most purposes. 

Hence, the indications are that the criterion of maximum entropy places 
an unduly strong weight on the behavior of the probability density in the tail 
regions. Note by comparison that over the interval < T < 5 the CRB values 
for the two solutions still strongly differ, with 1.317 for the Fisher information 
solution and 1.007 for the maximum entropy. Moreover, these are close to 
their values as previously computed over the entire range < T < oo. Hence, 
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Fisher information gives comparatively lower weight to the tail regions of the 
probability densities. 

3.3 The Probability Density in Option Prices 

While the reconstruction of probability densities from observed option prices 
has been the topic of much research (c.f. (Jackwerth 1999) and references 
therein), the expectation of global smoothness in the resulting probability 
densities has often not been achieved through the use of techniques that 
focus globally (e.g. maximum entropy (Stutzer 1994, Hawkins, Rubinstein & 
Daniell 1996, Buchen & Kelley 1996, Stutzer 1996, Hawkins 1997, Avellaneda 
1998)). As with term-structure estimation, various approaches to smoothing 
probability densities have been often introduced in an ad hoc manner. Fisher 
information, however, provides a natural manner for the introduction of a 
measure of local smoothness into the option-based density reconstruction 
problem. 

For purposes of illustration, let us consider the expression for the price of 
a European call option c(k): 



where k is the strike price, t is the time to expiration, and r is the risk-free 
interest rate. We take c(k), k, r, and t to be known and ask what p(x) is 
associated with the observed c(k). The observed call value c(k) can also be 
viewed as the mean value of the function e" ri max(x — k, 0), and it is from 
this viewpoint that a natural connection between option pricing theory and 
information theory can be made. 

Given a set of M observed call prices {c(k m )} the Fisher Information 
solution is obtained by solving Eq. |§] subject to these constraints and nor- 
malization, 



While the numerical method described in Appendix A can be applied to 
Eq. |2U by replacing the terms multiplying q(t) on the right-hand side of 
Eq. with a stepwise approximation or by treating the solution between 




(23) 




(24) 
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strike prices as a linear combination of Airy functions, Eq. |^ can be inte- 
grated directly using the Numerov method (cf. Appendix B). 5 The maximum 
entropy solution can be obtained in a straightforward manner substituting 
Eq. |U into Eq. |2H1 and varying the {A m } to reproduce the observed option 
prices. 

We applied the Fisher information approach to the example discussed 
in Edelman (2001) of 3 call options on a stock with an annualized volatility 
a of 20% and a time to expiration t of 1 year. In this example the interest 
rate r is set to zero, the stock price S t is $1.00 and the three option prices 
with strike price k of $0.95, $1.00, and $1.05 have Black-Scholes prices $0,105, 
$0,080, and $0,059 respectively. We note that the stock can also be viewed in 
this example as a call option with a strike price of $0.00. Given these 'obser- 
vations' together with the normalization requirement on p(x) we generated 
the solutions shown in Fig. El The smooth dash-dot curve is the lognormal 
distribution with which the 'observed' option prices were sampled 6 , and is 
the distribution we are trying to recover from the option prices. The Fisher 
information result is given by the solid curve and the MaxEnt result is the 
dashed curve that was calibrated to the observed option prices. 

We began by considering the case of the probability density implicit in a 
single at-the-money (i.e. k = $1.00) option price and the results are shown 
in the upper panel. The maximum entropy solution is a sharply peaked 
product of two exponentials. In contrast the Fisher information solution 
has, by virtue of requiring continuity of the first derivative, a much smoother 
appearance and is much closer in appearance to the lognormal density from 
which the option price was generated. We do, however, see that with only a 
single option price the asymmetric features of the lognormal density are not 
recovered. 

Adding the option prices at the k = $0.95 and k = $1.05 (from Edelman 
example) strikes to the problem results in the densities shown in the lower 
panel of Fig. El With this information the agreement between the Fisher 
information solution and the lognormal density is improved substantially 
with the peak of the density functions nearly the same and the characteris- 
tic asymmetry of the lognormal density now seen in the Fisher information 
result.. The maximum entropy solution continues to be sharply peaked and 
to deviate substantially from the lognormal density. Sharply peaked densi- 
ties have appeared in previous work involving limited financial observables 
(see, for example, (Kuwahara & Marsh 1994)) and this feature should not 
be interpreted as a specific indictment of the maximum entropy approach. 
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Figure 3: Probability density functions associated with option prices. 
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In previous work (Hawkins et al. 1996) we generated smooth implied den- 
sities from S&P-500 index options by assuming a lognormal prior based on 
the at-the- money-option {k — 1) option instead of the uniform prior as- 
sumed in Eq. |2] and forming a Lagrangian using the Kullback-Leibler en- 
tropy (Kullback 1959) 



G 



p(x) In 



p(x) 
r(x) 



dx 



(25) 



where r(x) is the prior - in our case lognormal - density. This points to one 
of the key differences between the maximum entropy and Fisher information 
approached: needed assumptions. When observations are limited - as they 
often are in financial economics applications - smooth maximum entropy 
densities often require a prior density: Fisher information simply imposes a 
continuity constraint. 



3.4 A Measure of Volatility 

In addition to providing a way of calculating the density implied by option 
prices, Fisher information yields, via the Cramer-Rao bound, a simultaneous 
measurement of the intrinsic uncertainty in this system. This uncertainty 
principle is, perhaps, most easily seen if we recall that Fisher information is 
generally presented in terms of a system specified by a parameter 9 with a 
probability density p(x\9) where 



p(x\9)dx = 1 



(26) 



and the Fisher information is 



p(x\9) 



dp{x\9)/d9 
p(x\9) 



dx 



(27) 



For distributions that can be parameterized by a location parameter 9, Fisher 
information can then be defined with respect to the family of densities 
{p(x — 9)} (cf. (Cover & Thomas 1991)). This transformation leads directly 
to Eq. El for the Fisher information. It also implies that the mean-square 
error e 2 in the estimate of that location parameter is related to this Fisher 
information, /, by the Cramer- Rao inequality e 2 J > 1. If, for example, p(x) 
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is a Gaussian, the minimum Cramer- Rao bound is / = l/o -2 with a 2 being 
the variance. Given that the probability density reconstructed from option 
prices is the conditional density of underlying asset levels at option expira- 
tion and given that a set of option prices is a measurement of the probability 
density it follows that the Fisher information, via the Cramer-Rao bound, 
provides a measure of the mean-square error in the estimate of the level of 
the underlying asset at expiration provided by the option prices and, thus, 
a natural generalization of the concept of implied volatility to non-Gaussian 
underlying-asset distributions. 

4 Discussion and Summary 

Fisher information provides a variational approach to the estimation of prob- 
ability laws that are consistent with financial observables. Using this ap- 
proach one can employ well developed computational approaches from for- 
mally identical problems in the physical sciences. The resulting probability 
laws posses a degree of smoothness consistent with priors concerning the 
probability density and/or quantities derived therefrom. In this paper we 
have shown how Fisher information can be used to solve some canonical 
inverse problems in financial economics and that the resulting probability 
densities are generally smoother than those obtained by other methods such 
as maximum entropy. Fisher information also has the virtue of providing 
a natural measure of, and computation approach for determining, implied 
volatility via the Cramer-Rao inequality. 

Comparing the maximum entropy approach to that of Fisher information 
it was seen that the former has the virtue of simplicity, always being an ex- 
ponential. The Fisher information solution is always a differential equation 
for the probability density. Although somewhat more complicated, this is, in 
some sense, a virtue as the probability densities of physics and, presumably 
of finance and economics, generally obey differential equations. Indeed, it is 
in this application that we see the greatest potential for future applications of 
Fisher information in finance and economics. For example, in addition to the 
problems examined in Sec. E3 recent developments in macroeconomic model- 
ing have focused upon placing all or nearly all heterogeneous microeconomic 
agents within one stochastic and dynamic framework. This has the effect 
of replacing the replicator dynamics of evolutionary games or the Malthu- 
sian dynamics of Friedman with the backward Chapman-Kolmogorov equa- 
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tion (Aoki 1998, Aoki 2001). Such a statistical mechanical representation of 
macroeconomics provides a natural link to Fisher information. The solutions 
of the Schroedinger equation (Eq. |UJ) are known to be stationary solutions 
to the Fokker-Plank equation (Gardiner 1996, Risken 1996). More gener- 
ally, however, the entire Legendre-transform structure of thermodynamics 
can be expressed using Fisher information in place of the Shannon entropy 
- an essential ingredient for constructing a statistical mechanics (Frieden, 
Plastino, Plastino & Soffer 1999) - and it has been shown (Frieden, Plastino, 
Plastino & Soffer 2002) that both equilibrium and non-equilibrium statistical 
mechanics can be obtained from Eq. the output of a constrained process 
that extremizes Fisher information. 

The elegant and powerful Lagrangian approach to physics, in use for 
over 200 years, has until a recent application of the Fisher information ap- 
proach (Frieden & Soffer 1995, Frieden 1998) not had a formal method for 
constructing Lagrangians. This formalism for deriving Lagrangians and prob- 
ability laws that are consistent with observations should prove to be a very 
useful tool in finance and economics. 
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Appendix A 

To generalize the amplitude-matching that we used in Sec. 13.11 we use a 
transfer matrix method that is employed often for formally similar problems 
in quantum electronics (Yeh, Yariv & Hong 1977) 7 . Given a set of discount 
factors {D(T m )} we seek a solution to 



d 2 q(t) = q(t) 
dt 2 4 



M 



A + ^A m 9(t-T m ,0) 



m=l 



(A-l) 
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Between tenors, say T m _i and T m it is straightforward to show that the 
probability amplitude q(t) is given by 



q(t) = A m e 



iLO m (t—T m ) 



+ B m e 



-iuj m (t—T m ) 



(A-2) 



where A m and B m are coefficients to be determined by the matching condi- 



tions, i 



-1, and 



\ 



3=0 



(A-3) 



with {Xj} being the Lagrange undetermined multipliers. We now consider 
the match across the m th tenor. Since t — T m = 0, continuity of q(t) gives 



M 



A L + B L = A m+ i + B 



m+l 



(A-4) 



where the subscript L denotes the value of the coefficients of q(t) immediately 
to the left of T m . Continuity of the first derivative of q{t) gives 



where A r 



Al — B L — A m \A m+ x + B m+ i) , 
LU m+ x/u m . These two equations combine to give 



" A L ' 


1 




~ 2 



1 + A r 
1 - A r 



A r 



Am+l 

B m +i 



(A-5) 



(A-6) 



Tm+l — 



Since A m = A L exp(-iu) m T m ) and B m = A L exp(iuj m T m ) where t„ : 
T m it follows that the coefficients for q(t) in layer m are related to those in 
layer m + 1 by 



A 


1 


B m 


~ 2 











1 + A„ 
1 - A„ 



A r 
A r 



A m +i 
B m +\ 



(A-7) 



Given this relationship we can generate the function q(t) sequentially, be- 
ginning with the region beyond the longest tenor that we shall take to be 
T m+ i. In this region we require that the solution be a decaying exponen- 
tial exp(— u m+ it) which implies that B m+ % = and that u m+ i be a positive 
real number. Given a {A m } and setting A m+ i = 1 , it is now a straightfor- 
ward matter to calculate the fixed income functions described in Sec. 13.11 
This transfer matrix procedure maps {A m } into a discount function and, in 
a straightforward manner, into coupon bond prices. 
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Appendix B 



Our choice of the Numerov method to solve Eq.l2"31follows from the popularity 
of this method for solving this equation in quantum mechanics (Koonin 1986). 
In this appendix we present a brief overview of the method and it's applica- 
tion to the option problem discussed in Section IH.HI 

The Numerov method provides a numerical solution to 8 

f^l + G(x)q(x) = , (B-l) 
that begins by expanding q(x) in a Taylor series 

q(x±Ax) = T <±M!^(£). (b-2) 

v ' ^ n\ dx n v ' 

n=0 

Combining these two equations and retaining terms to fourth order yields 

q(x + Ax) + q(x - Ax) - 2q(x) _ d 2 q(x) (Ax) 2 d 4 q(x) 4 

(Ax) 2 ~ dx 2 + 12 dx 4 + U j j ' 

(B-3) 

The second derivative term in Eq. lB-3l can be cleared by substituting Eq. lB-ll 
and the fourth derivative term can be cleared by differentiating Eq. IB- II two 
more times 

d A q(x) = d 2 (G(x)q(x)) ^ 

dx 4 dx 2 

^ G(x + Ax)q(x + Ax) + G(x - Ax)q(x - Ax) - 2G(x)q(x) 

(AxJ 2 ' 

and substituting this result into Eq. IB- II Performing these substitutions and 
rearranging terms we arrive at the Numerov algorithm: 

9 ( i m*? r \ f i _i_ (Ax-) 2 r 



®+* = / , A , 2 f ?» - 7 7T- 2 f ffn+i + 0((Ax) b ) . (B-5) 

(Ax) 2 \ / . . (Ax) 2 n 
H J2 - <-Jn+l I l-M J2 - ^n+l 

To apply this to the option problem it is useful to note that the calculation 
can be separated into two stages: first, the calculation of the {A , q(x)} pair 
that solves the eigenvalue problem for a fixed set of {A m } m>0 , and second, 
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the search for the set {A m } m>0 for which q(x) of the corresponding {A , q(x)} 
pair reproduces the observed option prices. 

The {Xo,q(x)} pair for a given {A m } m> o is obtained by integrating q(x) 
from both the lower and upper limits of the x range and varying Ao until the 
logarithmic derivatives of each solution match at an arbitrary mid point. In 
this example that point was x — 1 and Ao was found with a 1-dimensional 
minimization algorithm using the difference in the logarithmic derivatives as a 
penalty function and the minimum of the function Ylm=i max(i-fc m , 0)/4 
as the initial guess for Ao- 

Once the logarithmic derivatives are matched the option prices can be 
calculated and compared with the observed option prices. The results of this 
comparison form the penalty function for a multidimensional variation of the 
{A m } m >o to obtain agreement between the observed and calculated option 
prices. 



Notes 

1 9(x) = if x < and 1 otherwise. 

2 These particular parameters were chosen so that the trigonometric functions resolve 
into simple fractions. 

3 See, for example, Section 22 of (Landau & Lifshitz 1977). 

4 A similar pairing on the minimization of d 2 f(t)/dt 2 is seen in the work of Adams & 
Van Deventer (1994) and the recent entropy work of Edelman (2001). 

5 In this paper we focus solely on the equilibrium solutions: those densities that cor- 
respond to the lowest value of Ao- Equation l2"4l is solved by a collection of {Xo,q(x)} 
pairs; the solutions q(x) corresponding to other Ao being nonequilibrium solutions. These 
solutions are the subject of a forthcoming paper. 

6 The lognormal distribution is 

P(S T ,T\S t ,t) = 1 fjHS T /S t )-r { T-t)f\ 

V ' ; S T y/2n<r 2 (T-t) \ 2a 2 (T-t) J 

from which the Black-Scholes call price 

c(k) = S t N(d 1 ) - ke- r{T -^N{d2) 
ln(S t /k) + {r + a 2 /2)(T-t) 



di 



£?2 = di — <j\/T — t 

can be derived. 

7 Our presentation of this derivation was derived, in part, from unpublished optoelec- 
tronics notes of Prof. David A.B. Miller. 
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8 The Numerov method can, in fact, be used to solve numerically the somewhat more 
general equation 

f^-+G(x)q(x)=S(x). 
The development of the algorithm follows the same line as that illustrated in this Appendix. 
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